SI f . 



PATENT ABSTRACTS OF JAPAN 

(1 1 )Publication number : 2001 -265808 

(43)Date of publication of application : 28.09.2001 

(51)Int.CI. G06F 17/30 



(21) Application number : 2000-081037 (71)Applicant : SKYSOFT INC 

(22) Date of filing : 22.03.2000 (72)Inventor : HIROOKA YASUO 

TERANO TAKAO 
OTSUKA YUKICHI 



(54) SYSTEM AND METHOD FOR INFORMATION RETRIEVAL 

(57)Abstract: 

PROBLEM TO BE SOLVED: To search for and recommend a book which is both 

useful and unexpected to a customer. 

SOLUTION: Key word vectors of respective books that the purchaser bought in the 
past are put together by categories to generate a customer profile. According to a 
book database, the key word vectors are generated for each book. Then the similarity 
between the key word vectors of the customer profile and the key word vectors of 
unpurchased books is computed by combinations of books in the same category and 
between different categories, and the a book can be retrieved from the point of view 
of usefulness and unexpectedness. Thus, an unpurchased book with high similarity can 
be recommended. 
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CLAIMS 



[Claim(s)] 

[Claim 1] A classification means to be the information retrieval system with which the 
contents which a user desires out of the information space containing much contents 
are searched, and to classify each contents on said information space into the 
contents category of a predetermined number, A feature-vector generation means to 
generate the feature vector of each contents on said information space, A user 
profile generation means to generate the user profile containing the feature vector of 
the contents in which the user did past access, Information retrieval system 
characterized by providing a vector comparison means to compare the feature vector 
of a user profile and contents for every contents category, and a contents 
specification means to specify the contents which a user desires based on the result 
of said comparison. 

[Claim 2] It is the information retrieval system according to claim 1 characterized by 
generating a keyword vector with the coordinate value which said contents are the 
contents containing text documents, such as books, and hung the weighting factor at 
the frequency of occurrence of each keyword in the inside of contents on N 
dimension keyword space where said feature-vector generation means assigned the 
axis of coordinates to each of the keyword of N individual. 

[Claim 3] Said user profile generation means is information retrieval system according 
to claim 1 characterized by generating the user profile which a user compounds the 
feature vector of each contents which carried out past access for every contents 
category, and becomes from the feature vector for every contents category, 
[Claim 4] It is the information retrieval system according to claim 1 which said user 
profile generation means generates the user profile which consists of a feature vector 
for every contents category, and is characterized by said vector comparison means 
comparing the feature vector of a user profile with the feature vector of contents 
within the same category. 

[Claim 5] It is the information retrieval system according to claim 1 which said user 

profile generation means generates the user profile which consists of a feature vector 
for every contents category, and is characterized by said vector comparison means 



comparing the feature vector of a user profile with the feature vector of contents 
between different categories. 

[Claim 6] Said contents specification means is information retrieval system according 
to claim 1 characterized by specifying as contents which a user expects the high 
contents of a user profile and similarity. 

[Claim 7] Said contents specification means is information retrieval system according 
to claim 1 characterized by a user specifying among the high contents of a user 
profile and similarity as contents which a user expects a non-accessed thing. 
[Claim 8] The feature-vector generation step which is the information retrieval 
approach of searching the contents which a user desires out of the information space 
containing much contents, and generates the feature vector of each contents on said 
information space, The classification step which classifies each contents on said 
information space into the contents category of a predetermined number, The user 
profile generation step which generates the user profile containing the feature vector 
of the contents in which the user did past access. The information retrieval approach 
characterized by providing the vector comparison step which compares the feature 
vector of a user profile and contents for every contents category, and the contents 
specification step which specifies the contents which a user desires based on the 
result of said comparison. 

[Claim 9] said contents be the information retrieval approaches according to claim 8 
which be the contents containing text documents , such as books , and be 
characterize by generate a keyword vector with the coordinate value which hung the 
weighting factor at the frequency of occurrence of each keyword in the inside of 
contents on N dimension keyword space which assigned the axis of coordinates to 
each of the keyword of N individual at said feature vector generation step . 
[Claim 1 0] The information retrieval approach according to claim 8 characterized by 
generating the user profile which a user compounds the feature vector of each 
contents which carried out past access for every contents category, and becomes 
from the feature vector for every contents category at said user profile generation 
step. 

[Claim 11] The information retrieval approach according to claim 8 which generates 
the user profile which consists of a feature vector for every contents category at said 
user profile generation step, and is characterized by comparing the feature vector of 
a user profile with the feature vector of contents within the same category at said 
vector comparison step. 

[Claim 12] It is the information retrieval approach according to claim 8 which 
generates the user profile which consists of a feature vector for every contents 
category at said user profile generation step, and is characterized by said vector 
comparison step comparing the feature vector of a user profile with the feature 
vector of contents between different categories. 

[Claim 1 3] The information retrieval approach according to claim 8 characterized by 



specifying as contents which a user expects the high contents of a user profile and 
similarity at said contents specification step. 

[Claim 14] The information retrieval approach according to claim 8 characterized by a 
user specifying among the high contents of a user profile and similarity at said 
contents specification step as contents which a user expects a non-accessed thing. 
[Claim 15] It is the software storage which stores the computer software for making 
the information retrieval processing which searches the contents which a user desires 
out of the information space containing much contents perform on computer system 
in a computer-readable format. The feature-vector generation step to which said 
computer software generates the feature vector of each contents on said information 
space, The classification step which classifies each contents on said information 
space into the contents category of a predetermined number. The user profile 
generation step which generates the user profile containing the feature vector of the 
contents in which the user did past access. The software storage characterized by 
providing the vector comparison step which compares the feature vector of a user 
profile and contents for every contents category, and the contents specification step 
which specifies the contents which a user desires based on the result of said 
comparison. 

[Claim 1 6] Said contents be software storages according to claim 1 5 which be the 
contents containing text documents , such as books , and be characterize by 
generate a keyword vector with the coordinate value which hung the weighting factor 
at the frequency of occurrence of each keyword in the inside of contents on N 
dimension keyword space which assigned the axis of coordinates to each of the 
keyword of N individual at said feature vector generation step . 
[Claim 17] The software storage according to claim 15 characterized by generating 
the user profile which a user compounds the feature vector of each contents which 
carried out past access for every contents category, and becomes from the feature 
vector for every contents category at said user profile generation step. 
[Claim 18] The software storage according to claim 15 which generates the user 
profile which consists of a feature vector for every contents category at said user 
profile generation step, and is characterized by comparing the feature vector of a 
user profile with the feature vector of contents within the same category at said 
vector comparison step. 

[Claim 19] It is the software storage according to claim 15 which generates the user 
profile which consists of a feature vector for every contents category at said user 
profile generation step, and is characterized by said vector comparison step 
comparing the feature vector of a user profile with the feature vector of contents 

between different categories. 

[Claim 20] The software storage according to claim 15 characterized by specifying as 
contents which a user expects the high contents of a user profile and similarity at 
said contents specification step. 



[Claim 21] The software storage according to claim 15 characterized by a user 
specifying among the high contents of a user profile and similarity at said contents 
specification step as contents which a user expects a non-accessed thing. 



DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] This invention relates to the information retrieval system and 
the approach for discovering the contents which agreed in the demand of a user out 
of huge information contents, and relates to the information retrieval system and the 
approach of searching for desired contents especially based on user profile 
information, such as a hobby and taste of a user, and the past retrieval hysteresis. 
[0002] Furthermore, in order RIKOMENDO [ the goods which a customer desires 
when this invention performs business, such as a goods sale, on the Internet in detail 
/ discover and ], available information retrieval system and an available approach are 
started, and it is especially related with the information retrieval system and the 
approach which are applied in goods sale business which asks for the goods with 
which a customer has benefit and unexpected nature like a books sale. 
[0003] 

[Description of the Prior Art] It has become possible with improvement in the 
technology of the semi-conductor manufacture and others in these days to offer 
comparatively cheap computing systems including a personal computer (PC) in a 
commercial scene. A computing system is spreading extensively until it reaches the 
office and the ordinary homes of an enterprise or research facilities. Furthermore, the 
want to the network computing technique which interconnects in computer systems 
has also been mounting. On a network, computer resources, such as a file and a 
printer, can be shared between each users, or the collaboration-activity of 
informational share, negotiation, distribution, exchange, etc. can be done smoothly. 
[0004] The gestalt of the network which interconnects in computers is various. For 
example, LAN (Local Area Network) laid by part like Ethernet (trademark) or a token 
ring, WAN (Wide Area Network) which consists of dedicated lines etc, by 
interconnecting in LANs, the "Internet" (The Internet) which accomplished growth to 
the network of worldwide magnitude literally as a result of having repeated 
interconnect of networks and having performed it further, etc. are various. 
[0005] The Internet came to grow up to be a huge network, as a result of the server 
installed in each university, research facilities, etc. repeating interconnect 
independently. The servers on the Internet usually interconnect with the TCP/IP 



(Transmission Control Protocol/Internet Protocol) base. 

[0006] On the Internet, service of a large number which operate on TCP/IP networks, 
such as WWW (World Wide Web), News, and TELNET (TELetypewriter NETwork), FTP 
(File Transfer Protocol), Gopher, is exhibited. Especially WWW is broader-based 
information retrieval system which offers the information space of hyperlink structure, 
and has also become the greatest factor which accomplishes the explosive growth 
and the rapid spread of the Internet. 

[0007] WWW is built with a client-server mold. A WWW server is computer system 
which works the server software called a HTTP demon, and stores the information 
resource described in the HTML format, and/or offers this resource. A WWW server 
exhibits information resources including a HTML document by onerous or onerous on 
the Internet. If TCP/IP is followed, these information resources are specified by the 
identifier of the format of URL (Uniform Resource Locator). Although a HTML 
document can include link relation with other information resources, a link information 
is also specified in a URL format. 

[0008] The WWW client of another side is a computer which operates using the user 
agent called a "WWW browser" or a "HTML browser." "Netscape Navigator" of U.S. 
Netscape Communications and "Internet Explorer" of U.S. Microsoft are the examples 
of representation of a WWW browser. 

[0009] A WWW client user can specify a desired information resource by entering the 
URL character string as a resource identifier in the location box on a browser screen. 
A WWW browser usually performs access to the information resource on a WWW 
server, i.e., the ejection of an information resource, according to a HTTP (Hyper Text 
Transfer Protocol) protocol. Furthermore, a WWW browser analyzes the acquired 
HTML document and displays the content on a browser window screen (that is, a 
homepage is assembled on a browser screen). On the WWW page displayed on the 
browser screen, a link information is displayed as "support." By clicking support, a 
user can do sequential assignment of the information resource which is the following 
reference place. That is, a user can search the WWW information-resource space 
around which the hyperlink was spread on the Internet through a WWW browser 
screen, and can access various information resources without a joint one after 
another (common knowledge). WWW is equipped with the advanced user interface 
which can perform information retrieval by the easy actuation by the dialogue input. 
[0010] The diffusion rate of the Internet is high and has already permeated deeply 
various research facilities, such as a university, the office of an enterprise, ordinary 
homes, etc. Current and the Internet are used as a means of not only mere 
information distribution but various business. They are the commercial distribution 
and the sale system called the so-called "internet business." 

[0011] For example, digital information, such as computer software, and music, image 
contents, can be distributed via the Internet. In such a case, most of the distribution 
channel and sale way of the goods needed in case it sells by archive media, such as 



CD and DVD, is omissible. 

[0012] Moreover, when internet business is applied to the conventional mail order, 
goods information can be momentarily distributed all over the world only by uploading 
the data contents equivalent to a catalog on one website. Consequently, the time and 
effort and cost which distribute or mail print media, such as a throwaway and a 
catalog, can be saved. There is instancy nature of goods information and commitment 
of goods can be concluded smoothly and promptly. 

[0013] By distributing goods information on the Internet, the consumer or commodity 
demand buried in the remote place can be unearthed comparatively easily. For a 
goods vendor side, even if it is little goods with which the shipment high to the extent 
that it arranges to a shelf space is not expected, it can sell at pinpoint to the 
consumer who are scattered in various places. Moreover, if it carries out from a 
consuming public side, even if it is goods which are not dealt with at a nearby firm but 
have feeling of a comparatively high price, such as an importation and a foreign book, 
a supplier company can be discovered easily and it can purchase by low cost. In other 
words, the threshold of a store becomes markedly low. 

[0014] Internet business is excellent in informational instancy nature. Moreover, a 
physical means like the showcase for a goods sale as other advantages is hardly 
needed, but it is mentioned that the initial cost which a goods sale and sales 
promotion take is low. By development of internet business, a consumer's position 
becomes strong and the width of face of goods selection is said to have been 
extended greatly, furthermore, the thing of a tariff which it pays and is alike and is 
adopted for secure electronic banking — a goods sale — all processes can almost be 
mechanized and automated by remote operation via a network. 

[0015] Fundamentally in internet business, it can trade through being interactive and 
an advanced user interface like WWW. Moreover, in the enterprise which develops 
internet business, it aims at PASONA rise-ization which customizes contents and 
service for every customer taking advantage of the point of being interactively 
connected to the customer. 

[0016] However, while it is realizable with the existing technique in WWW or an 
electronic mail, it is not easy to perform the response according to individual for 
every customer to prepare suitable contents for every customer, or to recommend. 
To such a technical problem, the "RIKOMENDA system" which generates 
recommendation information automatically is expected. 

[0017] For example, while being able to offer the new title according to liking to each 
reader and expanding customer service by applying a RIKOMENDA system to 
business which performs a books sale of a foreign book etc. on the Internet, it leads 

also to buildup of sales. 

[0018] Two methods called a "contents analysis method" and a "coordination filtering 
method" exist in the RIKOMENDA system conventionally. 

[0019] A contents analysis method is the approach, RIKOMENDO [ choose near 



contents by a user s interest and ], by comparing the user profile which is the 
description expression of contents and user interest expression used as a 
RIKOMENDO object. When applying to a foreign-book sale, the description expression 
of contents is accumulated as a books database which consists of the title of each 
books, an author name, a genre, a keyword, etc. Moreover, a user profile consists of 
the books purchasing hysteresis and retrieval hysteresis of the past besides profiles, 
such as a user's hobby and special ability. 

[0020] For contents analysis, the common knowledge technique of the information 
retrieval field is applicable. For example, vector-space model by the term with weight 
[1] Tools of analysis which compare the feature vectors about contents like retrieval 
of a mold can be used. That is, it searches for. the vector, i.e., a "keyword vector", 
which has the coordinate value which hung the weighting factor on the count of an 
appearance of each keyword in the inside of contents on N dimension space which 
assigned the axis of coordinates to each of the keyword of N individual, and desired 
contents are searched based on the similarity of keyword vectors. It can ask for 
similarity based on the inner product and distance of a vector. 
[0021] However, two next points are pointed out as a fault of a contents analysis 
method [2]. 

(1) The contents made applicable to retrieval are dependent on contents analytical 
skill. In the actual condition, it is restricted to the text document which can apply an 
information retrieval technique. 

[0022] (2) There is an inclination only for information similar to the information which 

the user esteemed to be extracted. 

[0023] When using information retrieval for RIKOMENDESHON of books, such as a 
foreign book, especially, only the things of the books purchased before and a similar 
content are not always liked. First of all, since reading is an act based on intellectual 
desire, it is desirable there is not only benefit, such as to agree to hobby and taste of 
a user (reader), but its RIKOMENDO [ books with unexpected nature or freshness ]. If 
RIKOMENDESHON is repeated only in about the books of a similar content, a 
customer will get bored soon. Therefore, the fault of the above (2) which a contents 
analysis method has may become fatal when applying to books sale business. 
[0024] Moreover, a coordination filtering method is a method which generates 
RIKOMENDO information by analyzing the contents assessment information between 
two or more users, without performing contents analysis. First, grouping of the 
associate with a similar interest is carried out by comparing the contents assessment 
information between users. And RIKOMENDO [ the thing whose oneself is not 
evaluated ] among the contents which his associate esteemed. The coordination 
filtering method is already adopted in some commercial solutions represented by 
NetPerceptions [3] and [4]. 

[0025] In order not to perform contents analysis, a coordination filtering method has 
the advantage that there is no limit in the target contents, and can be applied also to 



almost all media other than a text document. On the other hand, four next points are 
pointed out as a fault of a coordination filtering method [2]. 

[0026] (1) The sufficient number of users and an assessment information number are 
needed to the amount of object contents. When the numbers of users run short, 
coverage becomes narrow and it becomes impossible to make a RIKOMENDO object 
only the part restricted among all contents. Moreover, a user with a unique taste is 
anxious about RIKOMENDO itself becoming impossible etc. as a result in which an 
associate is not found. 

[0027] (2) Even if new contents are registered, it does not become a RIKOMENDO 
object until someone evaluates. 

[0028] (3) Since contents analysis is not performed, although the content is similar, 
assessment of the contents from which ID differs cannot make high users an 
associate. 

[0029] (4) It is difficult for the incorrect input of contents assessment to eliminate 
the effect which it has on RIKOMENDO to other users. 

[0030] When RIKOMENDO service is considered for books, fundamentally, both a 
contents method and a coordination filtering method can be applied. However, since 
an all directions type has a fault, respectively as mentioned above, we are anxious 
about the following point. 

[0031] (1) Possibility that the books [ RIKOMENDO / books ] will become only things 
similar to the books which the customer purchased in the past by the contents 
analysis method is high. If RIKOMENDESHON is repeated only in about the books of a 

similar content, a customer will get bored soon. 

[0032] (2) By books sale, since the target number of contents is huge, if the number 
of customers set as the object of RIKOMENDO service is not considerable magnitude, 
either, high quality service is not expectable with a coordination filtering method. 
[0033] 

[Problem(s) to be Solved by the Invention] The object of this invention is to offer the 
outstanding information retrieval system and the outstanding approach of 
RIKOMENDO [ the contents corresponding to a demand of a user / discover and ] 
out of huge information contents. 

[0034] The further object of this invention is to offer the outstanding information 
retrieval system and the outstanding approach of desired contents that it can search 
and RIKOMENDO can be performed, based on user profile information, such as a 
hobby and taste of a user, and past retrieval hysteresis, purchasing hysteresis. 
[0035] The further object of this invention is to offer the outstanding information 
retrieval system and the outstanding approach of RIKOMENDO [ the goods which a 
customer expects / discover and ], when performing business, such as a goods sale, 
on the Internet. 

[0036] The further object of this invention is to offer the outstanding information 
retrieval system and the outstanding approach of RIKOMENDO [ goods with which a 



customer asks for the unexpected nature other than benefit / discover and ] like a 

books sale. 

[0037] 

[Means for Solving the Problem] This invention is made in consideration of the 
above-mentioned technical problem. The 1 st side face The classification means or 
step which is the information retrieval system or the approach of searching the 
contents which a user desires out of the information space containing much contents, 
and classifies each contents on said information space into the contents category of 
a predetermined number, The feature-vector generation means or step which 
generates the feature vector of each contents on said information space, The 
classification means or step which classifies each contents on said information space 
into the contents category of a predetermined number, The user profile generation 
means or step which generates the user profile containing the feature vector of the 
contents in which the user did past access. The vector comparison means or step 
which compares the feature vector of a user profile and contents for every contents 
category, It is the information retrieval system or the approach characterized by 
providing the contents specification means or step which specifies the contents 
which a user desires based on the result of said comparison. 

[0038] However, in the information retrieval system or the approach concerning this 
invention, when dealing with the goods with which the contents category is 
beforehand specified like books as contents for retrieval, an above-mentioned 
classification means or an above-mentioned step may be skipped, and the existing 
contents category may be used as it is. An example of a books category is "BISAC 
Subject Category" which BISAC (Booklndustry Systems Advisory Committee) offers. 
[0039] In the information retrieval system or the approach concerning the 1 st side 
face of this invention, said contents may be the contents containing text documents, 
such as books. In such a case, you may make it said feature~vector generation means 
or step generate the keyword vector which has the coordinate value which hung the 
weighting factor on the frequency of occurrence of each keyword in the inside of 
contents on N dimension keyword space which assigned the axis of coordinates to 
each of the keyword of N individual. 

[0040] Moreover, said user profile generation means or step compounds the feature 
vector of each contents in which the user did past access for every contents 
category, and you may make it generate the user profile which consists of a feature 
vector for every contents category. 

[0041] Moreover, in said user profile generation means or step, when generating the 
user profile which consists of a feature vector for every contents category, even if 
like [ said vector comparison means or step / compare the feature vector of a user 
profile with the feature vector of contents and ] (order matching) within the same 
category, it is good. Or even if like [ said vector comparison means or step / compare 
the feature vector of a user profile with the feature vector of contents and ] 



(crossover matching) between different categories, it is good. 

[0042] Moreover, said contents specification means or step is specified as contents 
which a user expects the high contents of a user profile and similarity, and it may be 
made RIKOMENDO [ it / a user ]. 

[0043] Moreover, among the high contents of a user profile and similarity, a user 
specifies said contents specification means or step as contents which a user expects 
a non-accessed thing, and may be made RIKOMENDO [ it / a user ]. 
[0044] Moreover, the 2nd side face of this invention It is the software storage which 
stores the computer software for making the information retrieval processing which 
searches the contents which a user desires out of the information space containing 
much contents perform on computer system in a computer-readable format. The 
feature-vector generation step to which said computer software generates the 
feature vector of each contents on said information space, The classification step 
which classifies each contents on said information space into the contents category 
of a predetermined number, The user profile generation step which generates the user 
profile containing the feature vector of the contents in which the user did past 
access, The vector comparison step which compares the feature vector of a user 
profile and contents for every contents category, It is the software storage 
characterized by providing the contents specification step which specifies the 
contents which a user desires based on the result of said comparison. 
[0045] 

[Function] The information retrieval system and the approach concerning this 
invention are applicable suitable for retrieval, RIKOMENDESHON, etc. of contents 
which consist of text documents, such as books. 

[0046] By the information retrieval system and the approach concerning this invention, 
a customer compounds the feature vector of each books purchased in the past 
according to a category, and generates the customer profile which consists of a 
feature vector for every category. Moreover, a feature vector is generated for every 
books by performing keyword vector-space configuration processing to each books 
based on a books database. 

[0047] And RIKOMENDO [ the similarity of the feature vector of a customer profile 
and the feature vector which each non-purchased books have is calculated, and / the 
high non-purchased books of similarity ]. Books can be searched with performing 
similarity count of a feature vector in each combination between categories which are 
different in a list (crossover matching) from a viewpoint of the both sides of benefit 
and unexpected nature between the same categories (order matching). 
[0048] The feature vector dealt with in this invention is good by the "keyword 
vector" expressing the frequency of occurrence of the keyword which are scattered 
in contents, such as for example, a text document. A keyword vector is a vector 
which has the coordinate value which hung the weighting factor on the count of an 
appearance of each keyword in the inside of contents on N dimension keyword space 



which assigned the axis of coordinates to each of the keyword of N individual. 
[0049] Although the information retrieval system and the approach concerning this 
invention make a contents analysis method the keynote, they can cancel the fault of 
a contents analysis method by carrying out the device which expands the width of 
face of RIKOMENDO information. Moreover, also when customer magnitude is small, 
offer of high quality RIKOMENDO service can be enabled because the number of 
object contents applies this invention in a very big type of industry like a books sale. 
[0050] The software storage concerning the 2nd side face of this invention is a 
medium which offers computer software in a computer-readable format to the general 
purpose computer system which can perform various program codes, for example. 
Attachment and detachment of CD (Compact Disc), FD (Floppy Disc), MO (Magneto- 
Optical disc), etc., etc. are free for such a medium, and it is a storage of portability. 
[0051] Such a software storage defines the collaboration-relation on the structure of 
the computer software and the storage for realizing the function of computer 
software predetermined in a computer system top, or a function. If it puts in another 
way. by installing predetermined computer software in computer system through the 
software storage concerning the 2nd side face of this invention, on computer system, 
a collaboration-operation is demonstrated and the same operation effectiveness as 
the information retrieval system or the approach concerning the 1st side face of this 
invention can be acquired. 

[0052] The object, the description, and advantage of further others of this invention 
will become [ rather than ] clear by detailed explanation based on the example and 
the drawing to attach of this invention mentioned later. 
[0053] 

[Embodiment of the Invention] The information retrieval system concerning this 
invention can be applied to for example, a books sale, can discover the books which 
each customer desires from the crest of the books of a huge number, and can 
recommend RIKOMENDO, i.e., purchasing. 

[0054] In drawing 1 , the procedure in which the information retrieval system 
concerning this invention generates RIKOMENDO information is illustrated. 
[0055] Although it is common in contents analysis type RIKOMENDESHON for a text 
document to use a keyword vector as a description expression of the customer 
profile which is the expression of customer interest, or contents, the same is said of 
the case of the information retrieval system concerning this invention. A keyword 
vector is N dimension vector which has the coordinate value which hung the weighting 
factor on the count of an appearance of each keyword in the inside of contents on N 
dimension space which assigned the axis of coordinates to each of the keyword of N 
individual (above-mentioned). 

[0056] In this information retrieval system, the keyword vector of a customer profile 
is generated using books information (books database) and a customer s purchasing 
hysteresis information (purchasing hysteresis database). Here, the keyword vector of 



a customer profile consists of vectors by which a customer compounds the keyword 
vector of each books purchased in the past according to a category, and is generated. 
Purchasing hysteresis information consists of a customer's member identifier (ID), an 
identifier (ISBNrlnternational Standard BookNunber) of the books purchased in the 
past, etc. 

[0057] On the other hand, a keyword vector is generable for every books by 
performing keyword vector-space configuration processing to each books based on a 
books database. A books database consists of the title of each books, content 
explanation (an epitome/abstract), an author name, a category, etc. 
[0058] And the similarity of the keyword vector of a customer profile and the keyword 
vector of each non-purchased books is calculated. RIKOMENDO [ the high non- 
purchased books of similarity ] to a customer. 

[0059] Subsequently, it explains in detail about creation processing of a customer 
profile. 

[0060] In this invention, the action which discovers the books whose interest the 
customer suited at the actual bookstore assumes following a books purchasing 
behavioral model as shown in drawing 2 . Action until it results in the decision of 
purchase books is divided roughly into two of the actions which choose the action 
which chooses the bookshelf which suited the interest field, and the books which met 
interest using the corresponding information on the back cover of the books of a 
bookshelf, or a table of contents as a graphic display. From the description of each 
action, "classification mold selection action", a call, and the latter can be called 
"retrieval mold selection action" for the former. 

[0061] In the information retrieval system concerning this invention, two kinds of 
customer interest expression systems called the keyword vector used as the books 
category equivalent to classification mold selection action and the criteria of retrieval 
selection action are adopted based on the books purchasing behavioral model shown 

in drawing 2 . 

[0062] You may build from zero for the information retrieval system concerning this 
invention, and a books category can also apply the existing books category. As an 
existing books category, "BISAC Subject Category" which BISAC (Book Industry 
Systems Advisory Committee) offers can be mentioned, for example. BISAC 
SubjectCategory It has classified into 49 kinds of categories, such as Art, Business & 
Economics, Cooking, and Computers. 

[0063] In drawing 3 , it is BISAC. The graph which investigated a customer's number 
of books purchasing categories is shown using Subject Category. Ranging over two or 
more categories, he has purchased books in many cases like the person who 
purchased two or more books, and could understand that there is an inclination for 
the width of face of the interest over common books to be very wide. 
[0064] When performing books sale business on the Internet in which various kinds of 
information retrieval system including WWW (World Wide Web) was substantial, it is 



common that a user, i.e., a customer, accesses a sale site himself, and he searches 
the target books. A books title, an author name, a publishing company, a keyword, etc. 
are used for retrieval. In the case of the customer who is interested in two or more 
categories, what a completely different keyword for every books category to search 
will be properly used for can be imagined easily. 

[0065] Therefore, when creating the customer profile expressing customer interest, 
and the width of face of a customer s interest is wide, it is considered that it is 
necessary to create two or more keyword vectors. 

[0066] Two or more keyword vectors according to purchasing books category 
constituted the customer profile from the information retrieval system concerning this 
invention, assuming a books category to be the unit of customer interest. 
[0067] A keyword vector is used as a concrete customer profile. A keyword vector is 
a vector which has the coordinate value which hung the weighting factor on the count 
of an appearance of each keyword in the inside of contents on N dimension keyword 
space which assigned the axis of coordinates to each of the keyword of N individual 
(above-mentioned). 

[0068] A keyword vector consists of items on a books database, using a title, content 
explanation, an author, and a subcategory (what classified 49 classifications into the 
detail further about 3,700) as an item representing the description of books. Although 
the phrase in the data of a title or the text format of content explanation is used as a 
keyword, you may make it also include the information on an author or a subcategory 
into a vector as one keyword generally. 

[0069] At this example, a customer profile consists of keyword vectors for every 
books category. In drawing 4 , the keyword vector creation procedure of a customer 
profile is illustrated. First, a keyword vector is given for every books. Subsequently, 
when this customer adds together the keyword vector of each books purchased in 
the past for every books category, the keyword vector for every purchasing books 
category of a customer is created. 

[0070] Moreover, to drawing 5 , the creation procedure of the keyword vector for 
every books is illustrated. However, it is necessary to perform a text manipulation 
about the title of books, and content explanation. The content of the text 
manipulation is the following procedure. 

[0071] (1) Give a part of speech to each word contained in a text. As a tool for giving 
a part of speech, it is BrilTs English tagger. [5] is used. 

[0072] (2) Deletion of an unnecessary word deletes parts of speech a noun, an 
adjective, and other than a foreign word using part-of-speech information. Then, the 
unnecessary word list created in advance is applied. 

[0073] (3) Use the word which remained without being deleted as a word keyword. 
[0074] (4) A compound keyword uses the combination as a compound keyword, when 
a noun continued and appears in a text. 

[0075] About an author, a books category, and a subcategory, weighting is performed 



like an above-mentioned word keyword and an above-mentioned compound keyword, 
and it adds to a keyword vector. In this example, the weighting technique which used 
the frequency of occurrence TF of a phrase (Term Frequency) and the reverse 
reference frequency IDF (Inverse Document Frequency) and which is called TFIDF [1] 
is used for the approach of weighting. 

[0076] Let the number of books in which the frequency of occurrence of the j-th 
keyword tj of the word, the compound, author, and subcategory in the i-th books di is 
set to fij, and Keyword tj appears in the number N of complete-works family registers 
be what is set with nj in the following explanation. Moreover, weight set up for every 
classification of a keyword is set to k. In this description, experientially. weight for 
every keyword classification was set to k= 2 about the subcategory, and, other than 
this, was boiled, therefore was set to k= 1 . 

[0077] Moreover, in order to ease the effect of [ when the same keyword appears 
superfluously ], we decided to evaluate the frequency of occurrence of each keyword 
using TFij defined by the formula of the following which took in pair number 
representation, not using the frequency of occurrence fij of the j-th keyword tj in the 
i-th books di as it is. 
[0078] 

[Equation 1] 

|R~ I 

[0079] Moreover, we decided to take in pair number representation and to use IDFj 

defined by the following formulas about reverse reference frequency. 

[0080] 

[Equation 2] 

Ml 

[0081] Moreover, the weight wij given to the keyword tj in Books di serves as a 
degree type using each above-mentioned type [a-one number] and above-mentioned 
[-two number]. 
[0082] 

[Equation 3] 

Fl" I 

[0083] The final keyword vector for every books gives the keyword of 1 2 high orders 
to the descending order of the weight wj of Keyword tj at each books. However, when 
realizing this invention, there is not necessarily no need of setting up the upper limit 
of a keyword, and a upper limit is not limited to 12 pieces, either. 
[0084] As shown in drawing 4 , a customer profile consists of the sums for every 
category of the keyword vector of the purchasing books for every books category, i.e., 
a synthetic vector. Weight wjkl of the keyword tj about Customer's Uk category CI is 



defined like a bottom type. 
[0085] 

[Equation 4] 



[0086] However, in a top type [a-four nunnber], bik sets to 1, when Customer Uk has 
purchased Books di, and it is set to 0 except it. Moreover, sil sets to 1. when Books di 
belong to Category CI, and it is set to 0 except it. 

[0087] Selection of the books [ RIKOMENDO / books / a customer ] is performed by 
calculating the similarity showing the keyword vector of a customer profile, and the 
description of books of a keyword vector. The cosine of the angle between keyword 
vectors to make is used for the count approach of similarity. 

[0088] If the weight of the keyword tj in Customer's Uk category CI is set with wjkl 
and it sets with the weight wij of the keyword tj in Books di, riki will become as a 
degree type whenever [ recommendation / of the books di to Customer's Uk category 
CI ]. 
[0089] 

[Equation 5] 



[0090] Subsequently, RIKOMENDESHON processing of books is explained. 
[0091] Generally in the contents tools of analysis using a feature vector like a 
keyword vector, the description comparison is performed based on the similarity of 
vectors. 

[0092] In this example, the description comparison is performed for every category. 
Two kinds, "order matching type RIKOMENDESHON" which more specifically 
compares the keyword vectors contained in the same books category, and "crossover 
matching type RIKOMENDESHON" which compares the keyword vectors contained in 
a different books category, are performed. 

[0093] Order matching type RIKOMENDESHON procedure is illustrated in drawing 6 . 
[0094] It consists of synthetic vectors of the keyword vector in each books category 
with the past purchasing hysteresis, a customer's interest expression, i.e., customer 
profile. In the example shown in drawing 6 , the keyword vector expressing a customer 
profile is generated in each of the books categories A and B which the customer had 
purchased in the past. 

[0095] On the other hand, the keyword vector of the books contained in each of the 
books categories A and B to which the keyword vector for every books is generated 
(refer to drawing 4 ), among these a customer s purchasing hysteresis was left behind 
is extracted based on a books database. 



[0096] Furthermore, it calculates within the same books categories A and B, a 
comparison, i.e., the similarity, of the keyword vector of a customer profile, and the 
keyword vector of books. What is necessary is just to use the above-mentioned [-five 
number] for count of similarity. And RIKOMENDO [ a customer / a customer ] by 
making a non-purchased thing into a retrieval result among the high books of 
similarity. 

[0097] In order matching type RIKOMENDESHON, only by saying [ that some 
keywords were in agreement ], will be RIKOMENDO the books of the category which 
is completely uninterested, namely, they can prevent unnecessary or useless 
RIKOMENDESHON. 

[0098] Moreover, crossover matching type RIKOMENDESHON procedure is illustrated 

to drawing 7 . 

[0099] It consists of synthetic vectors of the keyword vector in each books category 
with the past purchasing hysteresis, a customer's interest expression, i.e., customer 
profile. In the example shown in drawing 7 , the keyword vector expressing a customer 
profile is generated in each of the books categories A and B which the customer had 
purchased in the past. Moreover, the keyword vector of the books contained in each 
of the books categories A and B to which a customer s purchasing hysteresis was left 
behind is extracted based on a books database (same as the above). 
[0100] Furthermore, the keyword vector of a customer profile is compared with the 
keyword vector of books between different books categories. In the example shown in 
drawing 7 . while comparing the customer profile of the books category A with the 
books keyword vector of the books category B, the customer profile of the books 
category B is compared with the books keyword vector of the books category A. What 
is necessary is just to use the above-mentioned [-five number] for the comparison of 
a vector, i.e., count of similarity. And RIKOMENDO [ a customer / a customer ] by 
making a non-purchased thing into a retrieval result among the high books of 
similarity. 

[0101] According to crossover matching type RIKOMENDESHON, the high 
RIKOMENDO information on unexpected nature is generable by making the relation of 
the books category of a keyword vector and the books description cross. 
[0102] For example, when performing action which looks for the books which have a 
required keyword from the target bookshelf like the books retrieval phase of a books 
purchasing behavioral model shown in drawing 2 , although it is interested from usually, 
it is the case where the books which completely contain the keyword of another field 
are found by chance. Crossover matching type RIKOMENDESHON is the technique of 
aiming at such effectiveness intentionally. 

[0103] As an intelligible example, RIKOMENDESHON to the customer who has 
purchased some books is considered from the books and the Cooking category about 
the category of Science Fiction to Star Trek. According to crossover matching type 
RIKOMENDESHON, the books of a Cooking category can be searched using keyword 



"Star Trek", and books called "Star Trek Cookbook" can be introduced. 

[0104] However, in crossover matching, it is considered when limiting to the books 

category in which had a purchase track record in the past in the books category set 

as the comparative object, and the customer is interested improves 

RIKOMENDESHON precision, are desirable. 

[0105] 

[Example] Hereafter, the example of this invention is explained in detail, referring to a 
drawing. 

[0106] The hardware configuration of the information retrieval system 10 with which 
operation of this invention is presented is typically shown in drawing 8 . Information 
retrieval system 10 is realized also by performing application (it considering as 
"RIKOMENDO application" temporarily in this description) predetermined in a 
common computing system top, although you may design as hardware of dedication. 
An example of a computing system is the PC / AT compatible machine or its 
succeeding machine of U.S. IBM. Hereafter, each part of a system 10 is explained. 
[0107] CPU (Central Processing Unit)1 1 which is the Maine controller of a system 10 
performs various kinds of applications under control of an operating system (OS). OS 
is good at Windows 98 of UNIX (trademark) or U.S. Microsoft Corp./NT. 
[0108] CPU 11 interconnects with other equipments by bus 19 as the graphic display. 
The memory address or I/O Address of a proper is given to each device on a bus 1 9, 
respectively, and device access is possible for CPU1 1 by these addresses. An 
example of a bus 1 9 is a PCI (Peripheral Component Interconnect) bus. 
[0109] RAM (Random Access Memory)12 is an volatile store used since the program 
code performed in CPUl 1 is loaded or the activity data under activation are stored 
temporarily, and usually consists of two or more DRAM (dynamic RAM) chips. 
Moreover, ROM (Read Only Memory)13 is the storage of a non-volatile with which a 
predetermined program code and data were written in everlastingly. On R0M13, the 
self diagnostic program (POST) performed to a power up, the code group for hardware 
input/output operation (BIOS: basic input/output system), etc. are stored. 
[01 10] The display interface 14 is an exclusive controller for processing actually the 
drawing instruction which CPU11 publishes, and is supporting the more desirable bit 
map drawing function equivalent to SVGA (SuperVideo Graphic Array) or XGA 
(extended Graphic Array). Once it is written in a frame buffer (not shown), the screen 
output of the drawing data processed in the display interface 14 is carried out at a 
display 21. Indicating equipments 21 are for example, a CRT (Cathode Ray Tube) 
display, a liquid crystal display display (Liquid Crystal Display), etc. 
[01 1 1] The input device interface 15 is equipment for connecting user input devices, 
such as a keyboard 22 and a mouse 23, to a system 1. The input device interface 15 
answers the coordinate directions input through the key input or mouse 23 by the 
keyboard 22, and generates interruption to CPU11. 

[01 12] The external instrument interface 17 is equipment for connecting external 



devices, such as a hard disk drive (HDD) 24 and the media drive 25, to a system 1. 
jhe external instrument interface 1 7 is based on interface specification, such as IDE 
(Integrated Drive Electronics) and SCSI (Small Computer System Interface). 
[0113] HDD24 is the external storage which carried the magnetic disk as storage 
support fixed (common knowledge), and excels other external storage in points, such 
as memory capacity and a data transfer rate. It calls it "install" to the system of a 
program to place on HDD24 in the condition that a software program can be 
performed. Usually, the executive program code of CPUsI 1, such as an operating 
system, and an application program, a device driver, is stored in HDD24 in un- 
volatilizing. An example of the application installed in HDD24 is the RIKOMENDO 
application concerning this example. Moreover, an example of the data file stored in 
HDD24 is the keyword vector and customer profile which are used for retrieval of 
information contents. 

[01 14] Moreover, the media drive 25 is equipment for loading with portable mold 
media, such as CD (Compact Disc), and MO (Magneto-Optical disc), DVD (Digital 
Versatile Disc), and accessing a dataHogging side. Portable mold media are used in 
order to mainly move backing up a software program, a data file, etc. as data of a 
computer-readable format, and these between systems (a sale, a negotiation, and 
distribution are included). For example, it is possible to distribute data files which 
distribute the RIKOMENDO application which starts this example through such 
portable mold media, or are used for retrieval of information contents, such as a 
keyword vector and a customer profile. 

[01 15] The network interface 16 can connect a system 1 to LAN (Local Area 
Network) according to predetermined communications protocols, such as Ethernet. 
Generally, the network interface 16 is offered with the gestalt of a LAN adapter card, 
and the PCI bus slot on a mother board (not shown) equips with it, and it is used. 
[01 16] On LAN, one or more computing systems which work as a server or a client 
are connected. Moreover, among the computing systems on LAN, a part functions as 
a router and is connected to the wide area network of the exteriors, such as the 
Internet. 

[01 17] On these networks, many computer systems are connected in the transparent 
condition, and the distributed computing environment is built. It becomes unnecessary 
for especially each user to recognize the location of a resource object, such as a 
program and data, under this distributed environment. Moreover, the procedure and 
the method which are performed in a computer are also dispersedly held and managed 
on the network. For example, the process which is operating on one certain computer 
on a network can call and perform procedure of the process which operates on other 
computers. 

[01 18] The servers which work on a network may be a file server, a print server, and 
a database server. Moreover, you may be the WWW (World Wide Web) server which 
exhibits a HTML (Hyper Text Markup Language) document according to a HTTP 



(Hyper Text Transfer Protocol) protocol on the Internet, and the mail server which 
offers e-mail transceiver service. 

[01 19] The information retrieval system 10 mentioned above can be positioned on a 
distributed network as a server which offers RIKOMENDO information to the 
information contents retrieval demand of books RIKOMENDO from a client etc. 
[0120] However, there is not necessarily no need of performing all the treatment 
processes of an information contents retrieval and RIKOMENDESHON sake on the 
single computing system 10. For example, after one certain computing system 
receives an information contents retrieval demand via a WWW server by using the 
structure of a remote procedure call of CGI (Common Gateway Interface) etc., you 
may make it leave activation of retrieval processing to other servers on a network. 
[0121] Moreover, distribution, such as a software program and data contents, is 
performed on a network (common knowledge). For example, it is possible to distribute 
data files, such as a keyword vector for contents retrieval [ **** / distributing the 
RIKOMENDESHON application concerning this example via a network ] and a 
customer profile, via a network. 

[0122] The information retrieval system 10 concerning this example is indicating the 
database used for retrieval of the books as information contents by the list at drawing 
9 . However, it is not necessary to accumulate and manage all of these databases on 
a single computer system, it arranges dispersedly to other one or more database 
servers and file servers on a network, and you may make it access suitably if needed. 
[0123] Subsequently, the information retrieval performed on information retrieval 
system 10 and generation processing of RIKOMENDO information are explained. 
Procedure explained below is realized by performing RIKOMENDESHON application 
concerning this example on the information retrieval system 10 which consists of 
common computer systems. Moreover, in the following explanation, information 
retrieval system 10 shall be applied to retrieval of books and RIKOMENDESHON. 
[0124] The procedure of the contents retrieval concerning this example is roughly 
shown in drawing 10 . 

[0125] Information retrieval system 10 initializes a temporary table first, in order to 
search the books as contents (step SI). In order to register the RIKOMENDO 
information which is the return value of this processing result into the "contents 
keyword table" for registering the frequency of occurrence (or keyword vector) of a 
keyword for every "keyword table" for registering the keyword used for contents 
retrieval, and contents, i.e., books, the "customer profile table" which manages the 
profile of the customer who is the demand origin of contents retrieval, and a list, a 
"recommendation contents table" is contained in a temporary table. 
[0126] Subsequently, creation processing of a contents keyword is performed (step 
S2). Although creation processing of a contents keyword is described in more detail 
"to drawing 1 1 , it is yielded to the after-mentioned for details. 

[0127] Subsequently, creation processing of a customer profile is performed (step S3). 



Although creation processing of a contents keyword is described in more detail to 
drawing 12 , it is yielded to the after-mentioned for details. 

[0128] Subsequently, selection processing of recommendation contents is performed 
(step S4). Selection processing of the recommendation contents in this example 
corresponds to generation of the RIKOMENDO information on books. As mentioned 
above, order matching type RIKOMENDESHON, and crossover matching mold 
RIKOMEN day SHON and both hybrid type RIKOMENDESHON are mentioned to 
books RIKOMENDO. Although order matching type RIKOMENDESHON procedure is 
described to drawing 13 and the procedure of crossover matching mold RIKOMEN day 
SHON is described in detail to drawing 14 and drawing 1 5 . respectively, the after- 
mentioned is yielded for details. 

[0129] In drawing 1 1 , the detail of the procedure of contents keyword creation is 
illustrated in the form of a flow chart. Hereafter, it explains according to this flow 
chart. 

[0130] First, it reads one record at a time from a contents table (step S1 1). And the 
explanatory note of books is decomposed into a keyword and a part of speech is 

given (step SI 2). 

[0131] Subsequently, it is confirmed whether the part of speech is contained in the 
unnecessary part-of-speech list (step SI 3). Subsequently, it is confirmed whether the 
keyword is contained in the unnecessary part-of-speech list (step SI 4). 
[0132] The words (for example, vocabulary of the property to appear frequently 
primarily like an article, "information", and a "system" etc.) which seldom have 
semantics even if it appears frequently in contents, such as a text document, are 
contained in the unnecessary part-of~speech list. The part of speech and keyword 
which are contained in the unnecessary part-of-speech list are excepted from a 
keyword (step SI 5). 

[0133] Subsequently, it is confirmed whether the record which is in agreement with a 
contents keyword table (books keyword table) already exists (step SI 6). When a 
record in agreement is discovered, only 1 carries out the increment of the frequency 
of occurrence of this record (step SI 7). On the other hand, when a record in 
agreement is not found, this record is added to a contents keyword table (step SI 8). 
1 is given as the frequency of occurrence and zero are given to this new record as 
weight. 

[0134] Processing of steps S13-S18 is repeatedly performed to all keywords (step 
SI 9). Moreover, processing of steps S1 1 -SI 9 is repeatedly performed to all contents 
(namely, books) (step S20). 

[0135] Subsequently, record count is counted and read from a contents keyword 
table for every keyword (step S21). 

[0136] Subsequently, all the read records are added to a keyword table (step S22). 
The record number of cases is substituted for contents frequency at this time. 
[01 37] Subsequently, one record is read from a contents keyword table (step S23). 



[0138] Subsequently, the record of the same keyword is read from a keyword table 
(step S24). 

[0139] Subsequently, weight (= frequency-of-occurrence /log (total number of 
contents / contents frequency)) is calculated (step S25). 

[0140] Subsequently, the record with which it corresponds in a contents keyword 

table is updated with the value which the above calculated (step S26). 

[0141] And a contents keyword, i.e., a books keyword, is completed by performing 

processing of steps S23-S26 repeatedly to all contents keywords. 

[0142] Moreover, to drawing 1 2 , the detail of the procedure of customer profile 

creation is illustrated in the form of a flow chart. Hereafter, it explains according to 

this flow chart. 

[0143] First, it reads one record at a time from the purchasing hysteresis table of the 
customer who is RIKOMENDO demand origin (step S31). 

[0144] And the record whose content ID corresponds is read only one out of a 
contents keyword table (step S32). 

[0145] Subsequently, it is confirmed whether Customer ID, Category ID, and the 
record whose keyword corresponds exist in a customer profile table (step S33). 
[0146] When a record in agreement exists, the weight of the record under this 
processing is added to the weight of this coincidence record (step S34). 
[0147] Subsequently, this record is added to a customer profile table (step S35). 
[0148] Processing of the above-mentioned steps S32-S35 is performed to all the 
records whose content ID corresponds (step S36). 

[0149] And processing of steps S31-S36 is performed to all the records in a 
purchasing hysteresis table (step S37). 

[0150] Moreover, to drawing 13 , the detail of order matching type RIKOMENDESHON 
procedure is illustrated in the form of a flow chart. Hereafter, it explains according to 

this flow chart. 

[0151] First, a keyword vector is read from a customer profile for every customer and 
category ID (step S41). 

[0152] Subsequently, one contents whose category ID corresponds are read from a 
contents table (step S42). 

[01 53] Subsequently, the keyword whose content ID corresponds is read from a 

contents keyword table (step S43). 

[0154] And whenever [ recommendation ] is calculated using the keyword vector by 
the side of a customer profile, and the keyword vector by the side of contents (step 
S44). The customer profile is already created according to the procedure shown in 
drawing 12 . 

[0155] Moreover, when the weight of the keyword tj in Customer's Uk category CI is 
set with wjkl and it sets with the weight wij of the keyword tj in Contents (books) di, 
riki is expressed by the above-mentioned formula [a-five number] whenever 
[ recommendation / of the contents di to Customer's Uk category CI ]. However. 



when wjkl is treated as zero when Keyword tj does not exist in Customer's Uk 
category CI, and Keyword tj does not exist in Contents di, wij is treated as zero. 
[0156] Subsequently, a new record is added to a recommendation contents table 
(step S45). 

[0157] Processing of the above-mentioned steps S42-S45 is repeatedly performed to 

all the contents, i.e., books, whose category ID corresponds (step S46). 

[0158] Moreover, processing of the above-mentioned steps S41-S46 is repeatedly 

performed to all the keyword vectors in a customer profile (step S47). 

[0159] Subsequently, whenever [ recommendation ] reads a record from a 

recommendation contents table to high order for every customer (step S48). 

[0160] And it is confirmed whether the record corresponding [ both sides' / Customer 

ID and / of content ID ] exists in a purchasing hysteresis table (step S49). A record in 

agreement is deleted from a recommendation contents table (step S54). 

[0161] Furthermore, it is confirmed whether the record corresponding [ both sides' / 

Customer ID and / of content ID ] exists in a recommendation hysteresis table (step 

550) . A record in agreement is deleted from a recommendation contents table (step 
S54). 

[0162] Subsequently, recommendation contents are counted for every customer (step 

551) . And it is confirmed whether the number of contents to recommend the counted 
number of cases to was exceeded (step S52). If it is less than the number of contents 
to recommend, additional registration of the record concerned will be carried out at a 
recommendation hysteresis table (step S53). On the contrary, when the number of 
contents to recommend has been exceeded, the record concerned is deleted from a 
recommendation contents table (step S54). 

[01 63] Processing of the above-mentioned steps S48-S54 is repeatedly performed to 

all the records in a recommendation contents table (step S55). 

[0164] Moreover, to drawing 14 and drawing 1 5 , the detail of the procedure of 

crossover matching mold RIKOMEN day SHON is illustrated in the form of a flow 

chart. Hereafter, it explains according to these flow charts. 

[0165] First, List A is initialized (step S61). 

[0166] And the record which carries out customer ID is read from a purchasing 
hysteresis table beginning (step S62) to read one record from a customer table (step 
S63). 

[0167] Subsequently, the list of categories ID with purchasing hysteresis is created, 
and it writes in List A (step S64). 

[01 68] Subsequently, the record corresponding [ Customer's ID ] is read for every 
category ID out of a customer pull file (step S65). 

[01 69] Subsequently, one contents which fulfill the following conditions 1 are read 
from a contents table (step S66). 

[0170] Conditions 1: Unlike the category ID of a customer profile (a category crosses), 
Category ID should be contained Category ID in List A. 



[0171] Subsequently, the keyword vector whose content ID corresponds is read out 
of a contents keyword table (step S67). The customer profile is already created 
according to the procedure shown in drawing 12 . 

[0172] Subsequently, whenever [ recomnnendation ] is calculated using the keyword 
vector by the side of a customer profile, and the keyword vector by the side of 
contents (step S68). The customer profile is already created according to the 
procedure shown in drawing 12 . 

[0173] Moreover, when the weight of the keyword tj in Customer s Uk category CI is 
set with wjkl and it sets with the weight wu of the keyword tJ in Contents (books) di, 
riki is expressed by the above-mentioned formula [a-five number] whenever 
[ recommendation / of the contents di to Customers Uk category CI ]. However, 
when wjkl is treated as zero when Keyword tj does not exist in Customer's Uk 
category CI, and Keyword 1;i does not exist in Contents di, wij is treated as zero. 
[0174] Subsequently, it is confirmed whether the record which corresponds with 
Customer ID exists in a recommendation contents table (step S69). When a record in 
agreement does not exist, additional registration of the record concerned is carried 
out at a recommendation contents table (step S72). 

[0175] Moreover, when the record which corresponds with Customer ID exists in a 

recommendation contents table, it is confirmed whether it is still higher than 

whenever [ recommendation / which whenever / recommendation / of the record 

concerned / calculated at step S68 ] (step S70). In being higher than whenever 

[ recommendation / which was calculated ], it updates whenever [ on the record 

concerned / recommendation ] with the value of whenever [ recommendation / which 

was calculated ] (step S71). Moreover, in not being higher than whenever 

[ recommendation / which was calculated ], it skips step S71. 

[0176] Category ID performs processing of the above-mentioned steps S66-S72 

repeatedly to all the contents corresponding to conditions 1 (step S73). 

[0177] Moreover, processing of the above-^mentioned steps S65--S73 is repeatedly 

performed to all the customer profile keyword vectors of the corresponding customer 

(step S74). 

[0178] Moreover, processing of steps S61-S74 is repeatedly performed to all the 
customers in a customer table (step S75). 

[0179] Subsequently, whenever [ recommendation ] reads one at a time in high order 

for every customer out of a recommendation contents table (step S76). 

[0180] And it is confirmed whether the record corresponding [ both sides' / Customer 

ID and / of content ID ] exists in a purchasing hysteresis table (step S77). A record in 

agreement is deleted from a recommendation contents table (step S82). 

[0181] Furthermore, it is confirmed whether the record corresponding [ both sides' / 

Customer ID and / of content ID ] exists in a recommendation hysteresis table (step 

S78). A record in agreement is deleted from a recommendation contents table (step 

S82). 



[0182] Subsequently, recommendation contents are counted for every customer (step 
S79). And it is confirmed whether the number of contents to recommend the counted 
number of cases to was exceeded (step S80). If it is less than the number of contents 
to recommend, additional registration of the record concerned will be carried out at a 
recommendation hysteresis table (step S81). On the contrary, when the number of 
contents to recommend has been exceeded, the record concerned is deleted from a 
recommendation contents table (step S82). 

[0183] Processing of the above-mentioned steps S76-S82 is repeatedly performed to 
all the records in a recommendation contents table (step S83). 
[0184] «comment» 

[1]:G. Salton and C. Buckley, Term-weighting approaches-in-automatic text retrieval.. 
Information Processing-and-Management, 1988, Vol.14, No.5, P.51 3-523[2]:Marko 
Balabanovic, and Yoav Shoham, Content-Based, Collaborative Recommendation., 
Commnications of the ACM, 1997, and Vol. 40, No.3, p66-72 [3] :P aul Resnick and 
Neophytos lacovou, Mitesh Suchak and etal., GroupLends:An Open Architecture for 
Collaborative Filtering of Netnews.Proceedings of the Conference on Conputer 
Supported Cooperative Work and 1994, pi 75 -186 [4] : NetPerceptionsInc, 
Recommendation Engine White-Paper.http: 

//www.netperceptions.com/literature/content/recommendation.pdf, 2000 [5] : Eric 
Brill, http://Rule Based Tagger and www.es jhu.edu/"brill/ home.html, 2000 [0185] It 
has explained in detail about this invention, referring to a specific example more than 
[addenda]. However, it is obvious that this contractor can accomplish correction and 
substitution of this example in the range which does not deviate from the summary of 
this invention. 

[01 86] At this example, although explained based on the books sale, even if it is the 
case where the articles and data contents other than retrieval of other goods or 
goods are discovered, it cannot be overemphasized that this invention can be realized 
similarly. Moreover, it is not necessarily limited to the contents retrieval on wide area 
networks, such as the Internet. For example, on the computer system of a stand- 
alone, even if it is the case where a contents search service is searched on a 
dialogue screen, it can do [ applying this invention similarly or ]. 

[0187] In short, with the gestalt of instantiation, this invention has been indicated and 
it should not be interpreted restrictively. In order to judge the summary of this 
invention, the column of the claim indicated at the beginning should be taken into 
consideration. 
[0188] 

[Effect of the Invention] As a full account was given above, according to this 
invention, the outstanding information retrieval system or the outstanding approach of 
RIKOMENDO [ the contents corresponding to a demand of a user / discover and ] 
out of huge information contents can be offered. 

[0189] Moreover, according to this invention, based on user profile information, such 



as a hobby and taste of a user, and past retrieval hysteresis, purchasing hysteresis, 
the outstanding information retrieval system or the outstanding approach of desired 
contents that it can search and RIKOMENDO can be performed can be offered. 
[0190] Moreover, according to this invention, when performing business, such as a 
goods sale, on the Internet, the outstanding information retrieval system or the 
outstanding approach of RIKOMENDO [ the goods which a customer expects / 
discover and ] can be offered. 

[0191] Moreover, according to this invention, the outstanding information retrieval 
system or the outstanding approach of RIKOMENDO [ can discover goods with which 
a customer asks for the unexpected nature other than benefit like a books sale, and / 
a customer ] can be offered. 

[0192] Although the information retrieval system and the approach concerning this 
invention make a contents analysis method the keynote, they can cancel the fault of 
a contents analysis method by carrying out the device which expands the width of 
face of RIKOMENDO information. Moreover, also when customer magnitude is small, 
offer of high quality RIKOMENDO service can be enabled because the number of 
object contents applies this invention in a very big type of industry like a books sale. 



DESCRIPTION OF DRAWINGS 



[Brief Description of the Drawings] 

[Drawing 1] It is drawing having shown the procedure in which the information 
retrieval system concerning this invention generated RIKOMENDO information. 
[Drawing 2] It is drawing having shown the books purchasing behavioral model when 
discovering the books whose interest the customer suited at the actual bookstore 
assumed in this example. 

[Drawing 3] BISAC It is the graph which investigated a customer s number of books 

purchasing categories using Subject Category. 

[Drawing 4] It is drawing having shown the keyword vector creation procedure of a 

customer profile. 

[Drawing 5] It is drawing having shown the creation procedure of the keyword vector 
for every books. 

[Drawing 6] It is drawing having shown order matching type RIKOMENDESHON 
procedure. 

[Drawing 7] It is drawing having shown the procedure of crossover matching mold 
RIKOMEN day SHON. 

[Drawing 8] It is drawing having shown typically the hardware configuration of the 
information retrieval system 10 with which operation of this invention is presented. 



[Drawing 9] The information retrieval system 10 concerning this example is drawing 
which indicated the database used for retrieval of the books as information contents 
by the list. 

[Drawing 10] It is the flow chart which showed the procedure of the contents retrieval 
concerning this example roughly, 

[Drawing 1 1] It is the flow chart which showed the procedure of contents keyword 
creation to the detail. 

[Drawing 12] It is the flow chart which showed the procedure of customer profile 
creation to the detail. 

[Drawing 1 3] It is the flow chart which showed order matching type 
RIKOMENDESHON procedure to the detail. 

[Drawing 14] It is the flow chart which showed the procedure (first half) of crossover 
matching mold RIKOMEN day SHON to the detail. 

[Drawing 1 5] It is the flow chart which showed the procedure (second halO of 
crossover matching mold RIKOMEN day SHON to the detail. 
[Description of Notations] 
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ROM 
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Display interface 
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Input device interface 
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Network interface 
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External instrument interface 
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Bus 
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Display 
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Keyboard 
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Mouse 
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Hard disk equipment 
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Media drive 
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egory Arts Business &; EconomicSv Cooking^ Co 

mputers:&if04 9a3S(D^7^=^U(C»«tTU^^o 
[0 0 6 3] DB^Cf*. BISAC Subject Category^ffi 

[0 0 6 4] www (Worldwide Web) 

[0 0 6 5] Lfc3b^oT. M^Rtt^^ll-^llS^P 

[0 0 6 6] :^mm\m^mm^mi^:^'Tixi:\t. mm 

[0 0 6 7] m#:M^5:iiS:/p:7z^;u<i:tTt*> 4^- 

[0 0 6 8] s»x-^^-x±co3sa^^6. mmom 
[0 0 6 9] *^]i0im*. mm':fayT^Mt. mm 



[0 0 7 0] ^tc. msK^t. mm%(D^-'7-\^^ 

^ h^l/OflF^#|li5&Elj»LTt>^o fits h 

[0071] (Dxrtixh Jc^^ti^=&*igl2:p^PI^ 
jvf^-r^o p^ii:&'ft^t"^fca6cD*y-/l/<!:LT(*s Bril 
Ts English tagger M^fflt^^o 

[0 0 7 2] (2) -^m^mimt. smmm^fs^^x 
[0073] ( 3 ) %mitvt\zm^ tcwm^^m^- 

y- Ki:LTfUffl"r^o 

[0 0 7 4] (4) mm%^-'j- Y\t. sca^tesii 

[0 0 7 5] mm. SS:<37^dU. t^■::^:^?T^dUtc^t^ 

(Term Frequency) t,^ScMM& I DF (Inverse Docu 
ment Frequency) ^ffll^fc> TP I DPHl^WSn^ 

[0 0 7 6] l.XTa)i«0^T«x iSaeDS^ditC^nt 

^mii • li^ii • mm • tf::?^ti5^d uo) j sa(;54^-'7 
- Ft jCDaiil«^^f ij<h:U ^fc. ^SSISNcD* 

^fc. 4^-'7-F<D«Symte^L/cfi^^k^f 
[0 0 7 7] $fc. cD4=—^-F;b^il«^eaiiILfc 

tt^ j «acD=^-r7- F t i<Dtiimmm. f i j^^^d^^ 

[0 0 7 8] 

riv=iog2(/,*+i) 

[0 0 7 9] ^/c. aixmsg^egiLTi*. nmmm^ 

ttLtco 
[0 0 8 0] 

mil 

IDF^^log,iN/n^) 
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[0 0 8 1] ^/cx *»d iii:*5tt^4^-'7-Kt jiC^xT [0 0 8 2] 

= TF^ X IDF^ - log, (/, • A: + 1) • log, (N/n^) 



[0 0 8 3] mm^t^mmmo^-'y- k • h^u 

[0 0 8 4] El4twxv'rj:atc. mmzfnyT^fMts 
«]v -r^fc-B^t^xd u«(D^^a:>7 h/bTti^^rn^o 

[0 0 8 5] 

[gi[4] 

[0 0 8 6] mu ±^ [iJf4] t^fcU^T. bik^is H 
SUkJb^SSd i^MALTt^/cii^^^ 1 ^ L. ttilX9\' 

[0 0 8 7] iimcuriy>K-r^»»<?>3iM*s mm 

[0 0 8 8] iISUkC7):^xdUCjtuJ$tt^4=--7-F 

[0 0 8 9] 
[fSS] 

[0 0 9 0] a VftlSlu 

[0 0 9 1] =^-'7-H h;U<^cfcd^#^^^ h 

[0 0 9 2] *^Sg#im*. :^}5=-dg«£:§tmS5=g'?T 
[0 0 9 3] Ill6icli. jli^^y^>^l^U3J?<>5r"->' 



[0 0 9 4] mm<Dmmmmrrst>'hmmynyT'<}\^ 

[0 0 9 5] m-M. 

[0 0 9 6] l^e^C. p]i;S:S:^7^d U AXtiBrtT\ 

mi^^t Lzm^iz u □ ;>< V K-r^o 

[0 0 9 7] lii^*y^><^^^SUZ!^>7^-->'3>Ti*> 

[0 0 9 8] ^fc. ^7tc^*. ^m-^^y^y^m^oi:^ 
[0 0 9 9] mmo)m^mmtr^t^'^mm':fnyr^f\/ 
•7- K • h/uo^^^^ h/i/T*i«^ti^o miiz 
dU AJS^t>B(DS'^^c:fct^T. mmynyy^ii^^mm 

d u MmBo^M^z^t^Ki^mmco^-y- k • 
h;i/=&ttiti-r^ (itj±) o 

[0 10 0] ^Stc. M^^»S:ti7^dU<DP^T\ il« 
T^dU ACDii$'/a77''</l/<h»^;^7^dU bcd»^4^ 

[0 10 1] 35M"^*y^V'yMU=i^Vx->'a V^Ccfc 
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[0 10 2] mxitt. mziZTfkLTcmmmmnm^y'ii^ 

[0 10 3] ^ti'Vmi^mtLTiits Science Fiction 

CDtFT^d U^^SStar Trek II t"^SS<h Cook ing;^7"d^ 

^ y > a > cfc ti its Cook i ng :^ 7^ d U + 
""■'7-rStar Trek^^fflC^T^^U "Star Trek Cook 
book'' i: t^ o tcmm^mftt ^Ct 3b^T * ^ o 
[0 10 4] fit. 3:sM"^^y^>^^^^:^t^T^*. ^no 

[0 10 5] 

mmt^o 

[0 10 6] ^8tcti. :$:mmcDmmizm-iti^mnm 
ifis«^i^x7^iAi ot*. mm(D/\-\^'y j^rmm 

rgzi^vK-Z':/';>5r-i/3Vj <i:f^) ^Mt-T^ 
iXTv -vxt^ai 0<D#g|5tJ:ot^Ti«9^-r^o 

[0 10 7] i/XT^Zxl 0(D^-(y • hO-^TaS 
^CPU (Central Processing Unit) 1 1 tt. :t^U 

-T-'iy'f^'iy'T.'TU (OS) <D»TT\ #acDZ:/ 

ttOW i n d ows 9 8/NTT<fct\ 
[0 10 8] m^O^iy. CPU 1 1 /^X1 9tCci: 

KuxjbMxr^-dftiT^u. CPU 1 1 (scnezHux 

lcJ:-pT3^SZ^-feX*i^RrtEi:«:oTt^^. /^X1 90) 
— fllJ^PC I (Peripheral Component Interconnect) 

[0 10 9] RAM (Random Access Memory) 1 2t*x 

CPU1 ncfci>Tmf^n^:/P'^^^/xo- K^p 



DRAM (dynamic RAM) ^'^y :?^T«^;rtl^o ^fcx 
ROM (Read Only Memory) 1 3t*s F/tSO>^P'^5L. 

It^BT^^^c ROM1 3±ltt*x <5y^^*. SaiSABt 
tC^fif^gBi^ifT^P^^/^ (POST) /\-K 
'^x7'Am;^^#^cr)P-Ki¥ (BIOS : S^A*:^ 

[0 110] xVx:/u-r • ^>^--7x-x i 4^^. 
C P U 1 1 ^W^f ^fillii#^^lll^tJ:«lST^/ci6<^ 
«fflP>hP-^T»y. cfcU»^L<t*SVGA (Su 
perVideo Graphic Array) Xt*XGA (extended Graph 
ic Array) ffl^cD^^y h^^y :/Jaiil«tg^^t/K- h LT 

^L^U^) ^C-M#^jZ.^*ifdt. S^x^S2UtiSffi 
{jB:^3^n^o a^^S2U*. mX^t. CRT (Cathod 
e Ray Tube) ir -< T^^Ky^^. ?Sal^o^vxV X ^J^'U-f 

(Liquid Crystal Display) ^^^e^^o 

[0 111] A:^«lil<V-S?->^x-"X1 5§*. ^-if- 
- K2 2-^V'i7Z2 3^^<DIl-+f A:^^^^i^XT^A 

-XI s\t. ^-t^-Yz2\zi^^^-xtmt-^^7. 

2 3 ^itX^tcmm^mXMZf&^L.X. C P U 1 He« 

LTiijtia.^^^^t-^0 

[0 112] ^^wmm^y^-y 7t*. /\~k 

•t'VX^' K^-f":::^ (HDD) 2 4'¥»>^xV7^' K^-T 

-72 5r^E(o^w^mm^'y7.T■h.^{zmmt^tzisb<Dm 
aT3fe^o «i^gi5as^v-$^-:7x-xi 7(*. 9mt. 

I DE (Integrated Drive Electronics) -^SCS I 
(Small Computer System Interface) :^<!fcDO^~- 

yj:-:^mmzmmt^o 

[0 113] HDD2 4(*s mmmwti.x(om%=r^ 
'T.^^m'^mizmmi.tc^m^mmm.i:^^) (Mm . 

cfcu^^tiTt^^o y:7 h'^xT' • ^^p^-^ix^^tfpj 

fg^t^HT H D D 2 4±^CS< ^^-I/P f'yL.CDz/X 
xiA^cD r^>Xh-;bJ <hP?^$^o MMx HDD2 4tc 

:/p>J^^ix. x/KX' K^</^^^£CDC P U 1 KDH 
S7^P^5/x'P-H3b^^S^Wlt^§*ft7*-tlTt^^o H 
D D 2 4tc-rvX h-;U^tl^7^-/U^-~>3 

2|s:^]^fytc«;SU=]p<VF • T^T/U^-i/aVT:* 

• h;b-^iI§:/p:7 7'-r;i/Tfe^o 

[0 114] ^fc. ^xVT^' K^^:?2 5ti. CD (C 
ompact Disc) -^MO (Magneto-Optical disc) ^ DV 
D (Digital Versat i le Disc) fe^cDRJig^y xV 
^aUT. x-^IB^ffitC7'<7HzX"r^/cA6CD^ST^^ 
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[0 115] toy h'7"-'^'-r>^-:7x-'X1 6{t. 
Ethernet ^ <i: <DF/f S<DiiM:^P h^il/ttSfo 
Ts i^XT^/xl AN (Local Area Network) ^Cjgfl 
r^Ctti^T^^o h'7-^'<>^-:7x-X1 

[0 116] LAN±ta*v v—j%y.it<7^^Tyht 

[0 117] eitlST^y h'7~<7±Tt*. ^IKOi+S^ 

[0 118] :toy vj-"? ±.zmmt^n-}%\t. mx 

X • +f-/\T2^'pTt.<fcU\ $fc. -f >^-#oy h±"e 
HTTP (Hyper Text Transfer Protocol) h^/U 
itt^^oTHTML (Hyper Text Markup Language) K4=- 
iL;><>h^^Bit"^WWW (Worldwide Web) ^-1% 

[0 119] jc^i.tdmw^m'iyT^T'U^ o\t. "^w.^-^ 

[0 12 0] ffiU IfSB^VT^V^y^^SttfUHP^Vx 

xxAi o±Tii^T'r^f£:v^t*ie^rt^^t\ 

fe^ 1 '^Otfeai>'XxA3b\ CGI (Common Gateway 

I nterf ace) ts WtH L(Dttffl^^*iJffl 



[0 12 1] ^fc. #.»y h'7-'5'±Ttis y:7h':7x7' 
• :/p^^iA-¥>x-'S? • =i>x>*y&<!:(OxVXhUt:^ 

K • h;i/-¥»ll«:^p:7 7'^/i/^^ox-^ • 377' -< 

/U^#oy h'7-^gftTiB<l"r^e:^*i^pTtgT^^^o 

[0 12 2] m9\,z\t. ^^mm\,z%^mmmmzy7.'T 
/xi o*f\ mm::^yi^>'yt\^T(ommo:>mM<orcmz 

cdt'-'J^^-x • ^-i%^yT^}\/ • -y—z^fc^^iRLT 

iBMLT. 4e;>MtCI^;i;T3il:y^-feX'r^<l:3tcLTt> 
[0 12 3] ::^tNT\ If fg^t^v^XT^Zx 1 0±TII^t:^ 

T\ *^Jft«c<l^U3^yx-i/3>-7^:/U'^--v' 
fix IffS^t^i/Xv^AI 0(*«S(D^^;S^tfU=i>«>x 

[0 12 4] HI otcf*. *:*ii«yte«^=3VT=-y*y«i 

[0 12 5] 1f Si^^i/Xf^A 1 Ot*. □VxV^ytt 

- K • ^-5^ h/U) *a®'r^fca6<7:) rz]>x>^y • 

[0 12 6] ::^t^T\ z}>y^y^y • 4^-9- KcD^tmsa 
(Xx*yys2) o ziVT^v^y • ^-9- K<^ 
#^$Q^(*v mi nc5-6tc:pL<fB3SLTLN^3b\ m 

[0 12 7] m^yayT-oi^oj^miBim^n 

mit. m^ 2tc*6iCPt<fBaLTt^^6\ i¥3fflt*m 

[0 12 8] J»lir]>T^>^y<DS»^«lS^?T5 

(X7^'y:/S4) e *:^ffl#^Jt2:*5tt^»M^VT^>^>0 
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ii#|!i^^:ot^T^iE1 1 3tc. =gM*7^y^>^Ui]p«> 

[0 13 0] zi>xv*y • T'-:?;i/3b^en3-"K 
^lo-roM^ffi-r (XT^^y^SI 1) o ^LTs 

*y:/S 1 2) o 

[0 13 1] D°piil3b^^^p"piiUXhtc^^tiT 

t^^3b^S3b^*f^X*y^t"^ iXT-yySl 3) o 

^^x^y'j7-r^ (X7^^y:/S 1 4) o 
[0 13 2] :^Sp°pilUX htctis h • K=^iL^ 

5) . 

[0 1 3 3] □ VT^v^y • =^-'7- K • x^-:?^ 

;l/ «S4^-'7-K-7^~7^VU) tc-Sff^U^-K/? 
ejic#at"^*^S*^^^x^y^f ^ Ux*y >^S1 

6) o -S[-r^un-K5b^^m**x/c3i^tct*. i^uzi 

- HcoaiiM®^ 1 ratJi^^f ^ (X7^^y:/s i 7) . 

fiDt"^ (XT^'yT^Sl 8) o CCDff*IU3-Kttt*. Hi 
[0 13 4] XT-^ys 1 3-S 1 8(t>iQS^. r^T 

o:^-^-\^izMLrmvmLmn-r^ Uxy:/si 
9) o XT^^y 7^5 1 I'-si 9cD$aii^v -r-^T 

^ (Xxy:^S2 0) o 

[0 13 5] =3>5'>^y • =^-7~ K • 5^-:^^ 

ffif U7^^yyS2 1) . 

[0 13 6] ::5?:i^T\ ii^aBLfct"^T^DUZ3-K^=^ 

-~7-K-x-:::?Vi/ic3lfial-^ (XT^'y:/S2 2) . c 

[0 1 3 7] I3>7^>*y • 4^—7— K • 7^—7^^ 

iU;t)^6U=3-K^1 oii^tH-r Ux*y7^S2 3) o 

[0 13 8] =^-9- K • 7^-:^;l/3t)^6Pl— + 

-7-K<^U3-K^SI^aif (Xx'y:rS2 4) o 

[0 13 9] «^ (=aiilSS-Mog (^HV 



25) o 

[0 14 0] ziVT^y^y • =^—7— K • 'r—'J 

(Xx^y >^S2 6) o 
[0 14 1] ^UT. XT^^y:^S2 3-'S2 6c7)$aS^ 

t^^t\,Z^^). ZlVxV^y • =lF-'7-F-r»fc'SS» 

=^-7- FTb'^^lS-r^o 
[0 14 2] ^fc. SI 2tz:t*. ilSyP7 7'-</l/#^ 

[0 14 3] ^1t\ UPP^VF^^TcT^S^P^O^m 

^M5^-::^vu*^e UP" F^ 1 -D-roM^tar (XT^^y 

■/S3 1) o 

[0 14 4] ^LT. P • 41-7- F • 

;i/flD«)b^e> p>5Fv^y I D^^-Sf-r^up-F^i-^ 
mm^^-t (XT^^y:/S3 2) o 
[0 14 5] m^zfuyr^ii • x-y;U* 

f ^UP-F«^??ffit-^A^S3C)^^5^x^y>>-r^ (XT^^y 
7^5 3 3) o 

[0 14 6] — ifct-^UP- F3b^#ffi-r§ii-&tC«. C 
SI^UP - FcOM^itlSSaS^O^UP- Fa>«^^SP 
St"^ (X5F»y^S3 4) o 
[0 14 7] :*t^T\ liS::^P7 7'>r;l/ • T—yMzz. 

a)UP- F^oiftO-r^ (Xx^yyS3 5) o 
[0 14 8] ±IBCDX7^'yyS3 2'-S3 5CDSa^^x 

PVxV^y i DA>— ifc-r^T^TOUP-Ftc^LTH 

fit"^ {TsT-^yS 3 6) o 
[0 14 9] ^LT. XT^^y 3 1 --S 3 ecD^lOiS 
ifMMM7^"::^VI/45<^t"^TCDUP-Ftz:*fLTll 

ffT^ (Xx^yT^S 3 7) o 
[0 15 0] $fc. HI 3ta*. |ii-=7*y^>^MUP^ 

[0 15 1] ^-r. il«7^P7r-<;U:b^6iiiSx tiT-:S 
u I Dme4^-'7~ F • h/l/^-^^j^ffi-r (Xxy :/ 

S4 1) o 

[0 15 2] pv7^v*y • f"— y;UA^6^x^rr 
g I D3(}^-sfc-r^pv7^v*y^i o^^ffi-r (xt^^t^ 

S42) o 

[0 1 5 3] :j^t^T\ PV-T^V^y • 4^—7— F • 
iS/t^i^nyT-y^y 1 Dt^^— a*r^4^-7-F^S6^ait" 
(Xx'y7^S43) o 

[0 15 4] ^LTx ii§7^P7 7''<;i/ffliJ<:D4^-7--"F 
•>^:<7h;U<k, P>7^>^yflycD4^--7-F-^'^ h/i/^ 

mi^Xs mmm.mmt^ (XT^^y7^s4 4) o n^:^ 
uyy^Mt. m^ 2\z7j<'twmmm\.zU'oTmzwm. 
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[0 15 5] ^Tc. liSUk<^^7^dfUC|tc:fett^4^- 

•7-Kt j^DM^^w' jki<?:*5#. =3>"x>*y mm) 

d liz^^f^^-'y- K t jCOm^wi j tlol-^Tct^. m 

l*w' jkl^-lfPi:LT»t\ ^fc. ri>T^>^ydilc4^ 
do 

[0 15 6] 3S(;t^T\ *tii=i>7^>^> • "T-yMzmm 

[0 15 7] ±m<DX7'^y':fS A 4 5 (Diim^. 

[0 15 8] ^fc. ±iB<DX5F^y3^S4 1-'S4 6<;)«1 

[0 15 9] :Jfet^T\ jfliuvxv^y • T^-^^/l/^^S. 
3^548) o 

[0 16 0] ^LT. mmmmy'-yii^^itmm \ or 

[0161] i^-etCx *tliMM7'--:::^Vb*teiiS i 
3b^^;b^^^x'y^7f ^ (XT^^y^SSO) o —mf^U 

5F'yys 54) o 

[0 16 2] mm:iyf->y^mssizti'yy 

^ (7.T^^y3^S5 2) . ^tliLfct^□V5^V^>aJ.Xl*lT 

»:tit*\ mmmMT—yMzmmiy::^- \'^^mmmmr 

^ (Xx»y:/S5 3) o jfi^c. }tMtfci^=3>7^>*y3S 
v^y • x-://U3b^6iyi«'r^ Uxy:/S5 4) . 

[0 16 3] ±iB<DX7^'y:^S4 8-'S 5 4CD^S^. 

T3»y51Lllff-r^ (X7^y:/S5 5) . 
[0 16 4] Ell 4JS^tf!l|1 SiC^*. 5SM^^y^ 

[0 16 5] vx\-A^mmitr^ ixy'yys 

6 1) o 

[0 16 6] ^LT. ii§7^->^;l/?b^SU=i-K^io 
m^mt U7=-^yyS6 2) <t<!:^ic. MMMMt^-:^^ 



;i/;b^6ilSl p-r^Uzi- K^fE^mf U7^^y:/S6 

3) o 

[0 16 7] :^t^T\ MMMM<^^^:^F7=-dU I D(^»U 

4) o 

[0 16 8] li§"//i/37 7-'</KD*^^6. MS 

[0 16 9] 13 VT^v^y • 7^--:::^/U*^Si'XTfiD 

^#i^«fcT=i>T^>^y^ioii^tii-r (T^T-^ys 

6 6) o 

[0 17 0] ^#1 : t?7"dU I D*i^iiS^p:7 7'^/i/ 

[0171] :ki^T\ 3>T^v*y • ::^^-'7-~ K • t^—^ 

^h;u^ii^ffi-r U7^*y:^S6 7) o m^yuyy^ 
Mt. HI i^zTj^rimmm.zu^Tmz^mi^tiTK.^ 

[0 17 2] iis:rp-7 7'-r/HW<;)=^-'7-K 

ffit^T. itiiit^itm^^ U7^^y>^s6 8) o rn^y 

uyy^Mt. mi 2ltn^r«iS#McSe'PTftte:f^^ 

[0 17 3] $fcx liSUkC>:^i7^dUCitcfctt^*- 
•7-Kt jOa^^w'jkl<!:^#. zi>7^>*y 
d itcait^4^-'7- K t jC7)M^ws ji:^t^fc<i:#^ H 
SUkO:tr7=-dUC|tcS4-r^3>7^>'yd iCD«l*^r 
ikitix ±IB<DSC [iS5] tCcfcoXaS-n^o fflU M 
m\})f^(Di3i^^^J C\\,z^-r?- H t j3b^#?5L^t^it^ 
t*w' jkl^-ifP<!:LTS^l\ *fc. Z3>T^>^y d 
-■7- Ft j*^#aL^t^i•^^*wi j3&-if LTffi 

do 

[0 17 4] ;^L^T^ mm^yT-yy • 7^-::?;ua)* 

tc. US I D^PVxV^y I D(D)9^:&*f^"-S(f ^Ua- 
f^'y y^S 7 2) . 

[0 17 5] ^fc. JiliuVT^v^y • x-:?;l/CD43ti:s 
w^r^m^iz^t. -^^siz. mmiy::i-F<Dimmt'^x 

y'yys 6 8Ti+St/c*ili/gJ:Ut>lll^^^^3b^=&^x 
^y^t-^ (X7^'y7^S7 0) o itSLfctSli^cfcy^S 

JtcDffiTMSff ^ U7=-'y:/S7 1) o ^fc. it^L/c 

[0 17 6] ±m0Xf'^yyS 6 6-^5 7 2<DSaS^. 



(14) 



mm 2001-265808 



[0 17 7] mtc. ±iB<0X7^^y3^s6 5-s7 3a)ia 

S74) o 

[0 17 8] mtc. X'r^yyS 6 1-57 4(D«m^. 

[0 17 9] ^ki^i:\ jtiii3 vx>^y • r—yivo^t^ 
[0 18 0] ^LT. ^HMM5^-::?Vi/*tcii§ i d;^^ 

[0181] «liMMf^-':?/u*tciiS i d;s^ 

[0 18 2] itig=iv-r>^y^li««j::^'i'V 

^ UT^y:/S8 0) o jtlitfct^nvxv^yiiciXrtT^ 
^ (X7=-'y>^S8 1) o Jt«tfct^=i>7^>*yiS 
>^y • 7^-:?;l/3b^6iJI^T^ (X7^^yys8 2) o 

[0 1 8 31 ±§^(0X7^^^73 7 6--SS2(Dmm^. 

T^t>31Lllff*r^ (Xx*y3^S8 3) o 
[0 18 4] «>t3K> 

[1] : G. Salton and C. Buckley, Term-weighting appr 
caches in automatic text retrieval.. Information P 
recessing and Management, 1988, Vol. 14, No. 5, P. 51 
3-523 

[2] : Marko Balabanovic, and Yoav Shoham, Content-B 
ased. Collaborative Recommendation., Commni cat ions 
of the ACM, 1997, Vol. 40, No, 3, p66-72 
[3] :Paul Resnick, Neophytos lacovou, Mitesh Sucha 
k, et al., GroupLends:An Open Architecture for Col 
laborative Filtering of Netnews. Proceedings of th 
e Conference on Conputer Supported Cooperative Wor 
k, 1994, p175-186 

[4] : NetPerceptions Inc., Recommendation Engine Wh 
ite Paper, http://www.netperceptions.com/literatur 
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